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(57) Abstract 

A method of detecting a polymorphism at a polymorphic site in the genome of an individual is described. The method involves 
amplifying a portion of a genomic DNA sample by polymerase chain reaction to produce an amplified segment that contains the polymorphic 
site, desalting the amplified segment, and determining the mass of the amplified segment by electrospray ionization mass spectrometry. 
Comparison of the mass of the amplified segment to a reference mass permits detection of the presence or absence of the polymorphism. 
Methods for heterozygosity and disease inheritance by mass spectrometry are also described. The figure shows the molecular weight 
transform spectrum obtained from the mass-to-charge spectrum of a 53-base oligonucleotide. 
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RAPID, ACCURATE IDENTIFICATION OF DNA SEQUENCE 
VARIANTS BY ELECTROSPRAY MASS SPECTROMETRY 



CROSS REFERENCE TO RELATED APPLICATIONS 
This application claims the benefit of U.S. 
Provisional Application No. 60/019,702, filed June 10, 
1996. 

BACKGROUND OF THE INVENTION 
This invention relates to detection of 
differences between nucleic acids. More particularly, 
the invention relates to rapid and accurate detection 
of differences between selected nucleic acids by 
electrospray mass spectrometry. 

As the Human Genome Project matures, the demand 
for detecting genomic sequence differences is 
increasing. Many of these sequence differences, or 
polymorphisms, lead to altered gene expression, i.e. 
mutations. Analysis of the nucleotide composition for 
a polymorphism on both homologous chromosomes in the 
genome of an individual provides information relevant 
to detecting the inheritance of genetic diseases and 
disorders. Many disease mutations are thought to be 
caused by such small differences, either single base 
substitutions or single base insertions or deletions. 
Using polymerase chain reaction (PCR) technology, a 
segment of DNA containing a polymorphism can be 
amplified from each chromosome and then analyzed to 
determine nucleotide composition. Current methods of 
analysis rely heavily on electrophoretic methods of 
detection, where relative mobilities of DNA fragments 
are compared to extrapolate sequence differences. 
E.g., M. Orita et al . , Detection of Polymorphisms of 
Human DNA by Gel Electrophoresis as Single- strand 
Conformation Polymorphisms, 8 6 Proc. Nat f 1 Acad. Sci. 
USA 2766-70 (1989); R.M. Meyers et al . , Detection of 
Single Base Substitutions by Ribonuclease Cleavage at 
Mismatches in RNA:DNA Duplexes, 230 Science 1242-49 
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(1985); M.D. Traystman et al . , Use of Denaturing 
Gradient Gel Electrophoresis to Detect Point Mutations 
in the Factor VIII Gene, 6 Genomics 293-301 (1990); S. 
Rust et al., Mutagenically Separated PCR (MS-PCR) : A 
Highly Specific One-Step Procedure for Easy Mutation 
Detection, 21 Nucleic Acids Res. 3623-29 (1993); 
P. A.M. Roest et al . , Protein Trucation Test (PTT) for 
Rapid Detection of Translation- termination Mutations, 
2 Hum. Mol. Genet. 1719-21 (1993). 

The most common technique for analysis of 
polymorphisms is nucleotide sequencing, particularly 
by the well-known dideoxynucleotide ("dideoxy") method. 
F. Sanger et al . , DNA Sequencing with Chain- 
terminating Inhibitors, 74 Proc. Nat ' 1 Acad. Sci . USA 
5463 (1977) . Sequencing a sample in which the locus 
for a polymorphism is identical for both chromosomes 

(a homozygote) generally provides a clean signal of 
the base composition for the polymorphic locus. On 
the other hand, if the nucleotide composition of a 
polymorphic locus is different on each chromosome (a 
heterozygote) , the sequencing signal is complex. In 
base- substitution polymorphisms, sequencing generates 
a multiple base signal for the polymorphic position. 
In base-deletion or base- insertion polymorphisms, 
sequencing generates multiple base signals starting at 
the polymorphic locus and propagating throughout the 
rest of the sequence. Although accurate and reliable, 
dideoxy sequencing is complex and labor intensive 
requiring many hours to analyze a sample. 

To simplifly the electrophoretic analysis of 
polymorphisms, methods such as oligonucleotide 
ligation assay (OLA) , mismatch PCR, and engineered 
restriction enzyme site analysis were developed. In 
OLA, two primers are designed to anneal adjacently on 
a genomic, amplified template. Subsequent 
thermocycling in the presence of a thermostable ligase 
covalently links the primers if they anneal to the 
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template without a gap between them and without a 
mismatch at the ends to be joined. This technique, 
and other similar techniques, is simple and about 5- 
times faster than sequencing, but it suffers from the 
drawback of requiring sequence-specific optimization 
to reduce false positives or false negatives. Another 
technique, sequence -based hybridization (SBH) , where 
samples are analyzed based on their differential 
hybridization to stationary oligonucleotide arrays, 
requires less optimization of the sample preparation, 
but may require rearrangement of the oligonucleotide 
array to generate practical differential 
hybridization. The SBH technique is about 20- times 
faster than sequencing and is potentially simpler and 
thus more robust than the other techniques mentioned 
above. The speed of the electrophoretic and SBH 
techniques for analyzing polymorphisms appears to be 
approaching a minimum of minutes . To achieve 
polymorphism analysis methods that can be carried out 
in seconds, however, new methods of analysis are 
needed. 

In view of the foregoing, it will be appreciated 
that providing a method for rapid and accurate 
detection and identification of sequence polymorphisms 
would be a significant advancement in the art. 

BRIEF SUMMARY OF THE INVENTION 
It is an object of the present invention to 

provide a rapid and accurate method of detecting 

polymorphisms in nucleic acids. 

It is also an object of the invention to provide 

a method of detecting heterozygosity at a selected 

site in the genome of an individual. 

It is another object of the invention to provide 

a method of detecting a genetic disorder or disease 

associated with a selected polymorphism in the genome 

of an individual . 
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These and other objects can be achieved providing 
a method of detecting inheritance by an individual of 
a selected genetic disease or disorder associated with 
a polymorphic site comprising a base substitution or a 
5 base deletion or insertion, comprising the steps of: 

(a) collecting a genomic DNA sample from the 
individual ; 

(b) amplifying a selected portion of the genomic 
DNA sample by polymerase chain reaction to produce an 

10 amplified DNA segment containing the polymorphic site 

and having a size such that a difference in mass 
between the amplified DNA segment and a corresponding 
portion of a reference DNA sample can be resolved by 
electrospray ionization mass spectrometry; 

15 (c) desalting the amplified DNA segment to 

produce a desalted amplified DNA segment; 

(d) determining the mass of said desalted 
amplified DNA segment by electrospray ionization mass 
spectrometry; and 

20 (e) comparing the mass to a reference mass 

determined for the corresponding portion of the 
reference DNA sample, wherein a difference between the 
mass and the reference mass indicates inheritance of 
the genetic disease or disorder. 

25 While the size of the amplified DNA segment is to 

be limited only by functionality in detecting mass 
differences between the mass of the amplified DNA 
segment and the reference mass, the size of the 
amplified DNA segment is preferably no greater than 

30 about 55 base pairs, more preferably no greater than 

about 50 base pairs, and most preferably no greater 
than about 45 base pairs . 

Various methods of desalting the amplified DNA 
can be used, as are known in the art. Preferred 

35 methods of desalting the amplified DNA include high 

pressure liquid chromatography, molecular weight cut- 
off spin filtration, and alcohol precipitation. 
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A method of detecting heterozygosity at a 
polymorphic site in the genome of an individual, 
wherein the polymorphic site comprises a site at which 
a base substitution or base deletion or insertion 
5 occurs in a mutant allele as compared to a wild type 

allele, comprises the steps of: 

(a) collecting a genomic DNA sample from the 
individual ; 

(b) amplifying a selected portion of the genomic 
10 DNA sample by polymerase chain reaction to produce 

first and second amplified DNA segments representing 
corresponding portions of both homologous chromosomes 
containing the polymorphic site, wherein the first and 
second amplified DNA segments each have a size such 
15 that a difference in mass between the first and second 

amplified DNA segments can be resolved by electrospray 
ionization mass spectrometry; 

(c) desalting the first and second amplified DNA 
segments to produce respective first and second 

2 0 desalted amplified DNA segments; 

(d) determining the masses of the first and 
second desalted amplified DNA segments by electrospray 
ionization mass spectrometry; and 

(e) comparing the masses of the first and second 
25 desalted amplified DNA segments, wherein a difference 

in the masses indicates heterozygosity at the 
polymorphic site. 

A method of detecting a polymorphism at a 
polymorphic site comprising a base substitution or a 
30 base deletion or insertion, comprises the steps of: 

(a) collecting a DNA sample to be tested; 

(b) amplifying a selected portion of the DNA 
sample by polymerase chain reaction to produce an 
amplified DNA segment containing the polymorphic site 

35 and having a size such that a difference in mass 

between the amplified DNA segment and a corresponding 
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portion of a reference DNA sample can be resolved by 
electrospray ionization mass spectrometry; 

(c) desalting the amplified DNA segment to 
produce a desalted amplified DNA segment; 
5 (d) determining the mass of the desalted 

amplified DNA segment by electrospray ionization mass 
spectrometry; and 

(e) comparing the mass to a reference mass 
determined for the corresponding portion of the 
10 reference DNA sample, wherein a difference between the 

mass and the reference mass indicates the presence of 
the polymorphism. 

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 
FIG. 1 shows a mass- to -charge (m/z) spectrum of a 
15 53-base oligonucleotide (SEQ ID N0:1), wherein the 

values in brackets denote measured m/z values and the 
superscripts denote calculated charges. 

FIG. 2 shows a molecular weight transform 
spectrum obtained from the mass- to- charge spectrum of 
20 FIG. 1. 

FIG. 3 shows an m/z spectrum of PCR-amplif ied 

DNA. 

FIG. 4 shows a molecular weight transform 
spectrum obtained from the m/z spectrum of FIG. 3. 
25 FIG. 5 shows (A) a molecular weight transform 

spectrum obtained from PCR-amplif ied DNA wherein the 
template genomic DNA displayed homozygosity at the 
polymorphic site, and (B) a molecular weight transform 
spectrum obtained from PCR-amplif ied DNA wherein the 
30 template genomic DNA displayed heterozygosity at the 

polymorphic site . 

FIG. 6 shows molecular weight transform spectra 
of pairwise mixtures of synthetic 53 -residue 
oligonucleotides, wherein the paired oligonucleotides 
35 differ only by a single base, as follows: (A) C and A; 

(B) T and G; (C) A and G; (D) C and T; (E) C and G; 
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(F) T and A; (G) theoretical molecular weights of the 
four oligonucleotides. 

FIG. 7 shows molecular weight transform spectra 
showing detection of alleles for a C to T base 
5 substitution polymorphism according to the present 

invention: (A) heterozygous mother; (B) heterozygous 
father; (C) homozygous daughter 1; (D) homozygous 
daughter 2; (E) theoretical molecular weights of 
amplified DNA strands from allele 1; (F) theoretical 
10 molecular weights of amplified DNA strands from allele 

2 . 

FIG. 8 shows molecular weight transform spectra 
showing detection of alleles for a two-base deletion 
polymorphism according to the present invention: (A) 
15 heterozygous mother; (B) father homozygous for allele 

1; (C) son 1 homozygous for allele 1; (D) heterozygous 
son 2; (E) theoretical molecular weights of amplified 
DNA strands from allele 1; (F) theoretical molecular 
weights of amplified DNA strands from allele 2. 

20 DETAILED DESCRIPTION 

Before the present method for rapid and accurate 
detection and identification of sequence polymorphisms 
is disclosed and described, it is to be understood 
that this invention is not limited to the particular 

25 configurations, process steps, and materials disclosed 

herein as such configurations, process steps, and 
materials may vary somewhat. It is also to be 
understood that the terminology employed herein is 
used for the purpose of describing particular 

30 embodiments only and is not intended to be limiting 

since the scope of the present invention will be 
limited only by the appended claims and equivalents 
thereof . 

It must be noted that, as used in this 
35 specification and the appended claims, the singular 

forms "a," "an, 11 and "the" include plural referents 
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unless the context clearly dictates otherwise. Thus, 
for example, reference to "an amplified segment" 
includes reference to two or more of such amplified 
segments, reference to "a polymerase" includes 
reference to a mixture of two or more of such 
polymerases, and reference to "a primer" includes 
reference to two or more of such primers . 

In describing and claiming the present invention, 
the following terminology will be used in accordance 
with the definitions set out below. 

As used herein, "genomic DNA" or similar terms 
includes reference to transcription products of such 
genomic DNA. It is well known in the art that RNA can 
be readily copied as cDNA, which can be amplified by 
PCR. Thus, amplification of genomic DNA and 
amplification of cDNA obtained from RNA are considered 
equivalent . 

Mass spectrometry provides a new method of DNA 
sequence analysis because differences in sequence 
composition can be measured from mass differences. 
Mass spectrometry is capable of very rapid sample 
analysis. For example, mass analysis of sequencing 
reaction products by electrospray ionization Fourier 
transform ion cyclotron resonance (ESI-FTICR) achieves 
mass analysis detection of all the reaction products 
with a 20 -second sampling of the ion beam. Even 
though most mass spectrometric detectors are capable 
of determining the mass of components in a sample in 
less than a second, the rate for sample analysis is 
limited to about 15 minutes per sample due to other 
factors, such as signal averaging and sample handling. 
Matrix-assisted laser desorption ionization time-of- 
f light mass spectrometry (MALDI-TOF) has been 
demonstrated as a viable technique for analysis of 
polymorphisms. L.Y. Ch'ang et al . , Detection of delta 
F508 mutation of the cystic fibrosis gene by matrix- 
assisted laser desorption/ionization mass 
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spectrometry, 9 Rapid Cornmun. Mass Spectrom. 772-7 '4 
(1995) . The resolution limit for this MALDI-TOF 
analysis, however, is approached with detecting two- 
base deletion polymorphisms in double- stranded DNA. A 
recent advance with a modified form of MALDI-TOF, 
involving delayed extraction of the ion plasma, has 
increased its mass resolution somewhat. P. Juhasz et 
al . , Applications of Delayed Extraction Matrix- 
Assisted Laser Desorption Ionization Time -of -Flight 
Mass Spectrometry to Oligonucleotide Analysis, 68 
Anal. Chem. 941-46 (1996). 

Mass spectrometric analysis has two stages, 
ionization and ion analysis. For analysis of 
polynucleotides, the ionization process employs either 
electrospray ionization (ESI) or Matrix-assisted laser 
desorption ionization (MALDI) . The typical analysis 
hardware for either of these two ionizers can include 
ion trap (IT) , time-of -flight (TOF) , quadrupole, or 
Fourier transform ion cyclotron resonance (FTICR) 
analyzers. ESI is typically interfaced with a 
quadrupole analyzer, and MALDI is generally coupled 
with a TOF analyzer. The accuracy of mass analysis 
using these methods is approximately as shown in Table 
l . 



Table 1 


Ionizer-Analyzer 


% Mass Error 


ESI -Quadrupole 


0. 01 1 


ESI-FTICR 


0. 0001 


ESI-IT 


0. 1 


MALDI-TOF 


0.03-0,1 


MALDI -FT ICR 


0.001 



At the level of a 50-mer (Mr ~ 17,000), all of 
these combinations of ionizer and analyzer should, in 
theory, be able to distinguish two molecules differing 
in mass by 15 daltons . However, in published results, 
TOF and IT analyzers both exhibit poor resolution such 
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that peaks differing by ± 20 daltons cannot be 
resolved. It has now been discovered that the peak 
resolution achieved for an ESI-quadrupole combination 
is capable of resolving 9 dalton peak differences, and 
that mass differences of 15 daltons are readily- 
discernible . 

There are six possible type of base substitution 
polymorphisms. Table 2 shows the absolute value of 
the mass differences for these six types of 
substitutions in double- stranded DNA . 





Table 2 




GC 


AT 


TA 


CG 


| GC 


0 


1 


1 


0 


AT 




0 


0 


1 


! TA 






0 


1 


CG 








0 



Table 3 shows the absolute value of the mass 
differences for these six types of substitutions in 
single- stranded DNA. 



Table 3 




G 


A 


T 


C 


G 


0 


16 


25 


40 


A 




0 


9 


24 


T 






0 


15 


C 








0 



Thus, for double- stranded DNA, the mass 
differences for these six types of base substitutions 
are only 0 or 1 dalton. For single -stranded DNA, 
however, the mass differences range from 9-40 daltons. 
Therefore, for base- substitution DNA polymorphisms to 
be analyzed by their mass differences, the samples 
must be denatured to single strands. Moreover, the 
separate mass analysis of each of the two strands of 
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double- stranded DNA provides two independent 
measurements of allelic composition. ESI-quadrupole 
analysis is ideally suited for analysis of base 
substitutions because it is inherently single-strand 
5 analysis. For a DNA sample to generate practical mass 

data by ESI-quadrupole analysis, it must be in a 
partially organic solvent that is virtually free of 
metallic, nonvolatile salts, e.g. methanol or 
acetonitrile . Thus, the low salt, organic nature of 

10 the solvent carrying a PGR product will generate, upon 

ionization, primarily single-stranded DNA ions. 

An ESI -MS analyzer is capable of measuring very 
large ions because, as for all mass analyzers, the 
property measured is mass to charge ratio (m/z) , and 

15 not mass directly. Analysis of nucleic acids is 

generally performed in negative ion mode. As the 
nucleic acid enters the gas phase, the phosphate 
backbone oxyanions, all of which are charged in 
solution, are partially neutralized by proton 

2 0 adduction. The amount of neutralization of oxyanion 

charge varies between the molecules and thus results 
in a population of molecules with varying charge 
states by integral increments. Thus, mass analysis of 
a compound by mass spectrometry generates a series of 

25 m/z peaks, where the charge state of the peaks 

increases as the value of measured m/z decreases. For 
example, ESI -MS analysis of a synthetic 

oligonucleotide containing 53 nucleotide residues (SEQ 

ID N0:1) generates a series of m/z peaks (FIG. 1) . 
30 For a given m/z peak shown in FIG. 1, the value inside 

the brackets is the measured m/z, and the superscript 

value is the calculated charge. 

The mass of a compound can be calculated from one 

of the m/z peaks with the following formula: 
35 M = (m/z)Z + Z, 

where M is analyte mass, m/z is charge- to-mass ratio, 

and Z is the value of the charge. This formula is 
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used to calculate the charge from the measured value 
of two m/z peaks and their integral relationship of 
charge. Once charge is determined, the mass of the 
compound can be calculated from each m/z peak, and 
5 thus provide multiple measurements of molecular 

weight, as shown in Table 4. The average of the 
masses calculated from each m/z peak determines mass 
accuracy to within 0.01% mass error. Using the same 
formula, a computer can automatically transform an m/z 

10 spectrum into a molecular weight spectrum according to 

the Fenn method, J.B. Fenn et al . , Elect rospray 
Ionization for Mass Spectrometry of Large 
Biomolecules, 246 Science 64-70 (1989) , hereby 
incorporated by reference, and thus render a molecular 

15 weight peak for each related m/z series, as shown in 

FIG. 2. 



Table 4 




m/z Peak 


Mass Estimate 


772.9 


21 


16,252 


811.6 


20 


16,252 


854. 3 


19 


16,251 


901. 9 


18 


16,252 


954.9 


17 


16,250 I 


1014. 6 


16 


16,250 


1082.5 


15 


16,253 


1159.9 


14 


16,253 


Average ± S.D. 




16,251.6 ± 1.12 



The presence of salt, e.g. Na 1 or K + , in the 
analyte solution competes with protons for 
neutralizing phosphate backbone charges. This salt 
adduction generates additional peaks in the ESI -MS 
30 spectrum. For example, in FIG. 2, the two peaks to 

the right of the main peak are salt adducts of Na + and 
2 Na\ If an ESI-MS analysis solution is inadequately 
desalted, then salt adduction peaks become the primary 
species observed. This is undesirable for at least 
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two reasons. First, the intensity of the signal is 
decreased by distributing the analyte mass over 
multiple peaks with varying amounts of salt adduction. 
Second, the mass spectrum of a complex mixture is 
5 confused by a much higher degree of m/z peak overlap. 

To analyze polymorphisms in genomic material by 
ESI-quadrupole mass spectrometry, it is advantageous 
to use PCR technology. PCR primers are designed to 
flank a polymorphic locus such that the nucleotide 

10 base pairs involved in the polymorphism are not 

defined by the primers. Preferably, the distance 
between the primers is about 1-5 nucleotides. PCR 
amplification generates a dsDNA product from each 
chromosome of a homologous pair. In individuals 

15 homozygous for the polymorphic locus, where the 

specific nucleotide composition at the locus (allele) 
is the same for both homologous chromosomes, the PCR 
amplification will generate one type of dsDNA product. 
In heterozygous individuals, where the sequence 

20 composition at the locus is different for each 

chromosome (two different alleles) , amplification 
generates two slightly different dsDNA products. 
Using ESI-quadrupole mass spectrometry, the molecule 
weight of each of the strands of a dsDNA PCR product 

25 can be measured. Each strand can be assigned to an 

allele type by comparison of the observed molecular 
weight to the expected weight of the allele types. 

ESI-MS provides several advantages over methods 
known in the art. For example, the ESI-MS technique 

30 is highly specific. The small size of amplified 

products enables observation of very narrow windows 
into the genome. Only the region of DNA between the 
DNA primers, i.e. the window, represents the genomic 
DNA. In the PCR amplification, the primers define the 

35 polymerization of their opposite strand base pairs. 

Hence, if the annealed primers are separated by only 
one base pair at their 3' ends, then only that base 
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pair on each homologous chromosome, out of the 3 
billion base pairs in the human genome, is viewed in 
the window. Larger windows could be designed, but may 
present problems. First, a larger window might 
5 incorporate multiple polymorphic loci, which could 

confuse or misidentify diagnosis. Second, a larger 
window increases the mass measured, but does not 
change the mass difference between alleles. Thus, 
larger mass amplification products decrease resolution 

10 of alleles in a polymorphism. Limiting the PCR 

amplification window to including only the bases 
involved in a putative polymorphism prevents 
interference from other nearby polymorphisms and 
maximizes resolution between alleles. 

15 Unlike electrophoretic techniques, the ESI-MS 

technique does not require empirical optimization of 
analysis conditions to elucidate polymorphy. Once the 
PCR amplification has been optimized for specificity 
and yield, the preparation of samples for mass 

20 analysis is the same for any amplified product. As a 

result, successful ESI -MS analysis appears to be 
dependent primarily on sample cleanup and to be 
independent of nucleotide composition. 
Electrophoretic techniques such as SSCP, however, 

25 require extensive exploration of running conditions to 

detect specific polymorphisms. Mismatch PCR and 
polymorphism selective restriction site engineering 
both are site-specific, thus must be re-optimized for 
each locus analyzed. 

30 The resolution limit of ESI-MS analysis for 

polymorphism detection is approached in T-to-A or A- 
to-T base-substitution polymorphisms. Resolution of 
the 9 dalton difference between such alleles, see 
Example 3 below, demonstrates that two peaks are 

35 resolvable as different alleles on a 53 -nucleotide 

substrate using a quadrupole mass detector, but that 
there is considerable overlap of the peaks under such 
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conditions . Using a Fourier transform ion cyclotron 
resonance (FTICR) mass detector can result in about 
100-fold increase in resolution of the alleles, thus 
making easy detection of alleles. Enhanced resolution 
5 of a T-to-A base- substitution polymorphism can be 

achieved on a quadrupole detector by substituting 
deoxyuridine for deoxythymidine in the PCR reaction 
protocol. This modification results in a 23-dalton 
separation between the alleles, thus enabling a 

10 quadrupole detector to readily distinguish the alleles 

of all types of base substitution polymorphisms in a 
53 -nucleotide substrate . 

ESI -MS is complementary to current technology by 
enabling confirmation of putative polymorphic loci and 

15 then rapid screening of sample sets. The ESI-MS 

technique improves reliability of diagnosis because it 
directly measures the mass of the alleles involved in 
a polymorphism, not their relative mobilities. ESI-MS 
analysis appears well suited for detecting base- 

20 substitution polymorphisms or base deletion/insertion 

polymorphisms, which are difficult to detect by 
current methods. Many disease genes result from 
nonsense, missense, and frame-shifting mutations. 
These mutations, which are primarily caused by base 

25 substitutions, or one- or two-base deletions or 

insertions, are readily detectable by ESI-MS analysis. 

ESI-MS can be used in the fields of disease gene 
detection, genotyping, tissue typing, and DNA 
forensics. ESI-MS analysis of a set of base 

3 0 substitution polymorphisms can be use to uniquely 

identify an individual, and thus may be a useful 
forensic tool. ESI-MS can also be used for tissue 
typing transplant or graft matching or for identifying 
pathogens, such as bacteria and viruses. This 

35 technique is also amenable to cancer dianostics by 

detecting the presence of a somatically mutated allele 
in either tissue biopsies or in blood. ESI-MS 
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analysis can be used as a technique to confirm the 
content of polymorphisms by detecting the expected 
alleles and then to rapidly screen for the presence of 
the alleles in genomes . 

5 Example 1 

Genomic templates were prepared by 
phenol /chloroform extraction of blood or of biopsy 
material according to methods well known in the art, 
e.g. J. Sambrook et al . , Molecular Cloning: A 

10 Laboratory Manual (2d ed. , 1989), hereby incorporated 

by reference. Oligonucleotides were either purchased 
from Genset or National Biosciences, or were 
synthesized according to methods well known in the 
art, e.g. S.A. Narang et al., 6 8 Meth. Enzymol . 90 

15 (1979); E.L. Brown et al . , 68 Meth. Enzymol. 109 

(1979); U.S. Patent No. 4,356,270; U.S. Patent No. 
4,458,066; U.S. Patent No. 4,416,988; U.S. Patent No. 
4,293,652; N.D . Sinha et al . , 24 Tetrahedron Lett. 
5843 (1983); N.D. Sinha et al . , 12 Nucl . Acids Res, 

20 4539 (1984); N.D. Sinha et al . , 15 Nucl. Acids Res. 

397 (1987); N.D. Sinha et al . , 16 Nucl. Acids Res. 319 
(1988), hereby incorporated by reference. Taq 
polymerase was purchased from either Perkin-Elmer or 
as "PCR SUPERMIX" from Gibco/BRL. PCR was carried out 

25 in either a Model 9600 or a Model 2400 Perkin-Elmer 

thermocycler according to methods well known in the 
art, e.g. U.S. Patent No. 4,683,195; U.S. Patent No. 
4,683,202; U.S. Patent No. 4,800,159; U.S. Patent No. 
4,965,18 8; PCR Technology: Principles and Applications 

30 for DNA Amplification (H. Erlich ed., Stockton Press, 

New York, 1989); PCR Protocols: A guide to Methods and 
Applications (Innis et al . eds, Academic Press, San 
Diego, Calif., 1990), hereby incorporated by 
reference. PCR amplification was generally optimized 

35 at 40 cycles as follows: annealing for 30 seconds at 

60-65°C, extension for 30 seconds at 72°C, and 
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denaturing for 10 minutes at 94°C. The primers were 
SEQ ID NO: 2 and SEQ ID NO : 3 . 

PCR product purification was performed by either 
reverse phase HPLC, molecular weight cut-off spin 
filtration, or ethanol precipitation. For most 
samples, the PCR products were isolated by reverse 
phase HPLC, lyophilized to dryness, resuspended in 
deionized water, and then ethanol precipitated. The 
supernate was then removed from the precipitate by 
aspiration, and the precipitate was lyophilized once 
again. The dry precipitate was then dissolved in 80% 
methanol, 10 mM triethylamine (TEA) , and subjected to 
quadrupole mass analysis according to methods well 
known in the art . 

Reverse phase HPLC was performed on a Waters HPLC 
with buffer A: 100 mM TEA-bicarbonate , pH 7, and 
buffer B: 100 mM TEA-bicarbonate, pH 7, and 50% 
methanol. Gradient elution was with 0 to 100% buffer 
B in 60 minutes, and the column was a 4 mm x 3 00 mm 
PRP-3 from Hamilton. Molecular weight cut-off spin 
filters were MICROCON 3 and MICROCON 10 from Amicon. 
Ethanol precipitation was performed at a final 
concentration of 70% ethanol and 0.7 M ammonium 
acetate at -20°C for greater than 1 hour. 

Samples for ESI -MS analysis were solvated in 80% 
methanol, 10 mM TEA, and then were injected into a 
Sciex QE mass spectrometer through a charged 
capillary. Ions were measured as a mass- to- charge 
ratio spectrum and then transformed into molecular 
weight spectra by the Fenn method, supra. 

FIGS. 3 and 4 show, respectively, the m/z 
spectrum and the molecular weight transform spectrum 
of PCR products amplified according to this example. 
Four major peaks were observed (FIG. 4) , which are 
further characterized in Table 5 . 
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Table 5 


j Peak 


Identity 


Predicted 
Mass 


Observed j 
Mass 


I 


SEQ ID NO: 1 


16,211 


16,211 


II 


SEQ ID NO: 4 


16, 419 


16, 419 


III 


SEQ ID NO: 5 


16, 524 


16,523 


IV 


SEQ ID NO: 6 


16, 731 


16,732 



These results show that two of the PCR products, 
peaks I and II, correspond to the sense and antisense 
strands of the predicted amplification product. The 

10 other two PCR products, peaks III and IV, correspond 

to the 3 1 -mono -adenylat ion products of peaks I and II. 
Taq polymerase is known to add such adenylate residues 
to amplification products in PCR. Table 5 also shows 
that comparison of the masses measured from the four 

15 amplification products, peaks I -IV, to the expected 

masses for such products demonstrates a mass- 
determining accuracy to within 0.01% mass error. 

Example 2 

In this example, the procedure of Example 1 was 
20 followed except that genomic templates were from an 

individual homozygous at a polymorphic site and from 
another individual heterozygous at the polymorphic 
site. SEQ ID NO : 1 illustrates a portion of the 
sequence of the sense strand of one allele of the 
25 polymorphic site, and SEQ ID NO : 7 illustrates a 

corresponding portion of the sequence of the sense 
strand sequence of the other allele at the polymorphic 
site. Thus, the two alleles differ only by a single 
base substitution. In the homozygote, where each 
30 homolog has the same allele, the mass analysis 

resulted in one peak for each of the four expected 
strands (FIG. 5A) . In the heterozygote , however, 
where one homolog has a CG base pair at the allelic 
locus and the other homolog has a TA base pair, the 
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mass analysis generated doublet peaks for each strand 
(FIG. 5B) . The molecular weight observed for each 
peak in the doublet corresponds to the expected 
molecular weight from each allele . In closer 
5 examination of the spectra, the spectrum from the 

homozygote shows a peak for the expected adenylated 
antisense strand, but the spectrum from the 
heterozygote does not appear to have an expected 
doublet at this position. The three other doublet 
10 peaks in the spectrum from the heterozygote each 

provide independent measurements of heterozygosity, 
thus the inherent redundancy of the technique enhances 
its robustness. 



Example 3 

15 In this example, oligonucleotides were 

synthesized to model all possible base substitution 
polymorphisms at the locus examined in Example 2 . 
These four oligonucleotides were identical except for 
having a different base at the polymorphic site. The 

20 four oligonucleotides are SEQ ID N0:1, SEQ ID NO: 7, 

SEQ ID NO: 8, and SEQ ID NO : 9 . Pairwise mixtures were 
made of all six possible pairs of oligonucleotides, 
and these mixtures were analyzed by ESI -MS according 
to the procedure of Example 1. FIG. 6 shows that 5 of 

25 the 6 pairs are easily resolved from each other {FIGS. 

6A-E) . The mixture of SEQ ID NO : 7 and SEQ ID NO: 8, 
having, respectively, T and A residues at the 
polymorphic site, shows partial overlap of the peaks 
(FIG. 6F) . Better resolution could be obtained by 

30 decreasing the size of the oligonucleotides, which 

would increase the percentage difference in mass 
between the oligonucleotides. 



35 



Example 4 

In this example, the procedure of Example 3 is 
followed except that SEQ ID NO: 10 is substituted for 
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SEQ ID NO:7. The 23 dalton difference in molecular 
weight between dA and dU at the polymorphic locus 
results in the mixture of SEQ ID NO: 8 and SEQ ID NO: 10 
being easily resolved. 

5 Example 5 

In this example, the utility of the present 
technique was explored for genotyping and for disease 
inheritance detection in family groups. ESI-MS 
genotyping in a four-member family for a C to T base 

10 substitution polymorphism was performed at a silent 

polymorphism locus in a candidate gene for Benign 
Neonatal Familial Convulsions (BNFC) . The procedure 
was as in Example 1 except that the primers were SEQ 
ID NO: 11 and SEQ ID NO: 12. The molecular weight 

15 spectra (FIG. 7) demonstrate detection of alleles, 

namely two alleles in the heterozygotic mother (A) and 
father (B) , and only one allele in the homozygotic 
daughters (C and D) . The ESI-MS analysis and 
assignment were performed without knowledge of which 

20 samples were homozygotic or heterozygotic. The 

assignment was confirmed by checking the sequence 
results from an automated sequencer that had 
previously been used to diagnose allele inheritance in 
the family. In comparison to the sequencing results, 

25 ESI-MS analysis demonstrates at least two advantages. 

First, ESI-MS generates data on the composition of 
each strand, sense and antisense, in the analysis of 
one sample, whereas automated sequencing requires two 
separate sequencing reactions, i.e. with a forward 

30 primer and a reverse primer, to achieve data on each 

strand. Second, ESI-MS gives positive detection of 
each allele in heterozygotes , whereas automated 
sequencing gives complex signals that can be difficult 
or impossible to interpret without performing 

35 additional experimentation. 
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Example 6 

In this example, ESI -MS technique was applied to 
disease inheritance detection. Analysis was performed 
on a family group with a defective allele for the 
5 Attenuated Polyposis Coli (APC) gene. The defective 

allele comprises a two-base deletion, which results in 
a 618 dalton difference between the PCR products of 
the alleles. FIG. 8 shows spectra obtained according 
to the procedure of Example 1, except that the primers 
10 were SEQ ID NO: 13 and SEQ ID NO : 14 . Spectra FIGS. 8A 

and 8D show the presence of peaks that are not present 
in the spectra FIGS. 8B and 8C, indicating the 
presence of the mutant allele. 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: Hopkins, Chris E. 

Gesteland, Raymond F. 
5 Peiffer, Andy 

Stauffer, Dora 
Leppert , Mark 
Crain, Pamela F. 
McCloskey, James A. 

10 (ii) TITLE OF INVENTION: Rapid, Accurate 

Identification of DNA 
Sequence Variants by 
Electrospray Mass 
Spe c t rome t ry 

15 (iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Thorpe, North & Western, 

L.L.P. 

(B) STREET: 9035 South 700 East, Suite 200 
20 (C) CITY: Sandy 

(D) STATE: Utah 

(E) COUNTRY: USA 

(F) ZIP: 84070 

(v) COMPUTER READABLE FORM: 

25 (A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 

Mb storage 

(B) COMPUTER: Toshiba T215 0CDS 

(C) OPERATING SYSTEM: Windows 95 

(D) SOFTWARE: Word Perfect 7.0 

30 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

35 (A) APPLICATION NUMBER: 60/019,702 

(B) FILING DATE: 10-JUN-1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Alan J. Howarth 

(B) REGISTRATION NUMBER: 36,553 

40 (C) REFERENCE /DOCKET NUMBER: T3 2 62.PCT/U- 

2212 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (801)566-6633 

(B) TELEFAX: (801)566-0750 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CCTACAACAC CAGGAAGTAC GAGTGCTGCG C CG AG AT CT A 40 
CCCGGACATC ACC 53 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
CCTACAACAC CAGGAAGTAC GAGTG 2 5 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
GGTGATGTCC GGGTAGATCT CGGC 24 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

GGTGATGTCC GGGTAGATCT CGGCGCAGCA CTCGTACTTC 40 
CTGGTGTTGT AGG 53 

(2) INFORMATION FOR SEQ ID NO: 5: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

CCTACAACAC CAGGAAGTAC GAGTGCTGCG CCGAGATCTA 4 0 

CCCGGACATC ACCA 54 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

2 0 GGTGATGTCC GGGTAGATCT CGGCGCAGCA CTCGTACTTC 40 

CTGGTGTTGT AGGA 54 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CCTACAACAC CAGGAAGTAC GAGTGCTGTG CCGAGATCTA 4 0 
30 CCCGGACATC ACC 53 

(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CCTACAACAC CAGGAAGTAC GAGTGCTGAG CCGAGATCTA 40 
CCCGGACATC ACC 53 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

15 CCTACAACAC CAGGAAGTAC GAGTGCTGGG CCGAGATCTA 40 

CCCGGACATC ACC 53 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCTACAACAC CAGGAAGTAC GAGTGCTGUG CCGAGATCTA 40 
2 5 CCCGGACATC ACC 53 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE : nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGCGAGTGGG TCATCGTGGA 20 
(2) INFORMATION FOR SEQ ID NO: 12; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 12: 
GTGTTGTAGG TGCCCACGGC 20 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ACGTACCTTT TTTTAAAAAA AAAAAATAGG TC 3 2 
(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

GGCTTCTTTG TCAAGATCAG CAAGAAGCA 2 9 
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CLAIMS 

We claim: 

1. A method of detecting inheritance by an 
individual of a selected genetic disease or disorder 
associated with a polymorphic site comprising a base 
substitution or a base deletion or insertion, 
comprising the steps of : 

(a) collecting a genomic DNA sample from said 
individual ; 

(b) amplifying a selected portion of said genomic 
DNA sample by polymerase chain reaction to produce an 
amplified DNA segment containing said polymorphic site 
and having a size such that a difference in mass 
between said amplified DNA segment and a corresponding 
portion of a reference DNA sample can be resolved by 
elect rospray ionization mass spectrometry; 

(c) desalting said amplified DNA segment to 
produce a desalted amplified DNA segment; 

(d) determining the mass of said desalted 
amplified DNA segment by electrospray ionization mass 
spectrometry; and 

(e) comparing said mass to a reference mass 
determined for said corresponding portion of said 
reference DNA sample, wherein a difference between 
said mass and said reference mass indicates 
inheritance of said genetic disease or disorder. 

2 . The method of claim 1 wherein said 
electrospray ionization mass spectrometry comprises 
ion detection with a quadrupole detector. 

3 . The method of claim 1 wherein said 
electrospray ionization mass spectrometry comprises 
ion detection with a Fourier transform ion cyclotron 
resonance mass detector. 



WO 97/47766 



PCT/US97/08518 



28 

4. The method of claim l wherein the size of 
the amplified DNA segment is no greater than about 55 
base pairs . 

5. The method of claim 4 wherein the size of 

5 the amplified DNA segment is no greater than about 45 

base pairs . 

6. The method of claim 1 wherein the desalting 
comprises high pressure liquid chromatography. 

7 . The method of claim 1 wherein the desalting 
10 comprises molecular weight cut-off spin filtration. 

8 . The method of claim 1 wherein the desalting 
comprises alcohol precipitation. 

9 . The method of claim 1 wherein said selected 
genetic disease or disorder is benign neonatal 

15 familial convulsions. 

10. The method of claim 9 wherein said 
amplifying comprises polymerase- catalyzed extension of 
primers having sequences represented as SEQ ID NO: 11 
and SEQ ID NO: 12 . 

20 11. The method of claim 1 wherein said selected 

genetic disease or disorder is attenuated polyposis 
coli . 

12. The method of claim 11 wherein said 
amplifying comprises polymerase-catalyzed extension of 
25 primers having sequences represented as SEQ ID NO : 13 

and SEQ ID NO: 14. 



13 ♦ A method of detecting heterozygosity at a 
polymorphic site in the genome of an individual, 
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wherein said polymorphic site comprises a site at 
which a base substitution or base deletion or 
insertion occurs in a mutant allele as compared to a 
wild type allele, comprising the steps of: 

(a) collecting a genomic DNA sample from said 
individual / 

(b) amplifying a selected portion of said 
genomic DNA sample by polymerase chain reaction to 
produce first and second amplified DNA segments 
representing corresponding portions of both homologous 
chromosomes containing said polymorphic site, wherein 
said first and second amplified DNA segments each have 
a size such that a difference in mass between said 
first and second amplified DNA segments can be 
resolved by electrospray ionization mass spectrometry; 

(c) desalting said first and second amplified 
DNA segments to produce respective first and second 
desalted amplified DNA segments; 

(d) determining the masses of said first and 
second desalted amplified DNA segments by electrospray 
ionization mass spectrometry; and 

(e) comparing said masses of said first and 
second desalted amplified DNA segments, wherein a 
difference in said masses indicates heterozygosity at 
said polymorphic site. 

14 . The method of claim 13 wherein said 
electrospray ionization mass spectrometry comprises 
ion detection with a quadrupole detector. 

15 . The method of claim 13 wherein said 
electrospray ionization mass spectrometry comprises 
ion detection with a Fourier transform ion cyclotron 
resonance mass detector. 
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16. The method of claim 13 wherein the size of 
the first and second amplified DNA segments is no 
greater than about 55 base pairs, 

17. The method of claim 16 wherein the size of 
5 the first and second amplified DNA segments is no 

greater than about 45 base pairs. 

18 . The method of claim 13 wherein the desalting 
comprises high pressure liquid chromatography. 

19. The method of claim 13 wherein the desalting 
10 comprises molecular weight cut-off spin filtration. 

20. The method of claim 13 wherein the desalting 
comprises alcohol precipitation. 



21. A method of detecting a polymorphism at a 
polymorphic site comprising a base substitution or a 
15 base deletion or insertion, comprising the steps of: 

(a) collecting a DNA sample to be tested; 
<b) amplifying a selected portion of said DNA 
sample by polymerase chain reaction to produce an 
amplified DNA segment containing said polymorphic site 
20 and having a size such that a difference in mass 

between said amplified DNA segment and a corresponding 
portion of a reference DNA sample can be resolved by 
electrospray ionization mass spectrometry; 

(c) desalting said amplified DNA segment to 
25 produce a desalted amplified DNA segment; 

(d) determining the mass of said desalted 
amplified DNA segment by electrospray ionization mass 
spectrometry; and 

(e) comparing said mass to a reference mass 
30 determined for said corresponding portion of said 

reference DNA sample, wherein a difference between 
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said mass and said reference mass indicates the 
presence of the polymorphism. 

22. The method of claim 21 wherein said 
electrospray ionization mass spectrometry comprises 
ion detection with a quadrupole detector. 

23. The method of claim 21 wherein said 
electrospray ionization mass spectrometry comprises 
ion detection with a Fourier transform ion cyclotron 
resonance mass detector. 

24. The method of claim 21 wherein the size of 
the amplified DNA segment is no greater than about 55 
base pairs. 

25 . The method of claim 24 wherein the size of 
the amplified DNA segment is no greater than about 45 
base pairs . 

26. The method of claim 21 wherein the desalting 
comprises high pressure liquid chromatography. 

27 . The method of claim 21 wherein the desalting 
comprises molecular weight cut-off spin filtration. 

28. The method of claim 21 wherein the desalting 
comprises alcohol precipitation. 
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